Using syntactic dependency - pairs con ationto improve retrieval performance in Spanish ?
نویسنده
چکیده
This article presents two new approaches for term indexing which are particularly appropriate for languages with a rich lexis and morphology, such as Spanish, and need few resources to be applied. At word level, productive derivational morphology is used to connate semantically related words. At sentence level, an approximate grammar is used to connate syntactic and morphosyntactic variants of a given multi-word term into a common base form. Experimental results show remarkable improvements with regard to classical indexing methods.
منابع مشابه
Using Syntactic Dependency-Pairs Conflation to Improve Retrieval Performance in Spanish
This article presents two new approaches for term indexing which are particularly appropriate for languages with a rich lexis and morphology, such as Spanish, and need few resources to be applied. At word level, productive derivational morphology is used to conflate semantically related words. At sentence level, an approximate grammar is used to conflate syntactic and morphosyntactic variants o...
متن کاملOn the Usefulness of Extracting Syntactic Dependencies for Text Indexing
In recent years, there has been a considerable amount of interest in using Natural Language Processing in Information Retrieval research, with specific implementations varying from the word-level morphological analysis to syntactic parsing to conceptual-level semantic analysis. In particular, different degrees of phrase-level syntactic information have been incorporated in information retrieval...
متن کاملTowards the development of heuristics
In this paper we study the performance of linguistically-motivated connation techniques for Information Retrieval in Spanish. In particular, we have studied the application of productive derivational morphology for single word term connation and the extraction of syntactic dependency pairs for multi-word term connation. These techniques have been tested on several search engines implementing di...
متن کاملResolving prepositional phrase attachment ambiguities in Spanish with a classifier ∗ Resolviendo las ambigüedades de adjunción de sintagmas preposicionales en castellano con un clasificador
In this paper we present a classifier that solves a certain kind of ambiguities in syntactic structure for Spanish, namely, ambiguities as to the point of adjunction of a prepositional phrase in the syntactic structure of a sentence (PP attachment). As a starting point, we used EsTxala dependency grammar for Spanish, integrated within FreeLing, with an accuracy score of 61 % on PP adjunction. O...
متن کاملTowards the Development of Heuristics for Automatic Query Expansion
In this paper we study the performance of linguisticallymotivated conflation techniques for Information Retrieval in Spanish. In particular, we have studied the application of productive derivational morphology for single word term conflation and the extraction of syntactic dependency pairs for multi-word term conflation. These techniques have been tested on several search engines implementing ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002